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SUMMARY 


Tfe“aBBwr  the  feasibility  of  a  goodness -of -fit  test  based  on  an  intergral  of  the 
weighted  squared  modulus  of  the  discrepancy  between  the  sample  and  population 
characteristic  functions .  The  resulting  statistic  is  therefore  analogous  to  the 
Cramer -von  Mi see  statistic  and  is  shown  to  reduce  to  it  as  a  special  case.  A  number 
of  properties  of  the  test  have  been  derived,  including  the  asymptotic  null  distribution 
of  its  statistic.  It  is  shown  that  under  mild  regularity  conditions  the  test  is 
consistent.  A  number  of  approximations  to  the  null  distribution  of  the  test  statistic 
are  considered,  and  are  found  to  be  successful  in  simplifying  its  application  without 
due  loss  of  accuracy. 

Keywords i  sample  characteristic  function,  goodness -of -fit, 

•*  weighted  sum  of  chi-squared  variates1,  consistency, 
asymptotic  distribution;  rate  of  convergence;  cumulants,  - 


approximate  distribution. 


1.  INTRODUCTION 


The  characteristic  function  corresponding  to  any  distribution  function  F(x)  is 
defined  by 


<Xu) 


dF(x) 


(l.D 


where  1*  =  -1,  is  uniquely  associated  with  r  (Lukacs,  1970,  p.  28).  For  many 
purposes  it  is  more  convenient  to  work  in  this  transform  space  rather  than  dealing 
directly  with  distribution  functions,  as  a  simple  and  well  known  example,  the  addition 
of  independent  random  variables  corresponds  to  a  multiplication  of  characteristic 
functions.  Similarly,  the  behavior  of  characteristic  functions  under  location  and  scale 
shifts  is  particularly  simple. 

If  Xj. ,  x2 , . . . ,  Xn  is  a  sample  consisting  of  independent  and  identically 
distributed  random  variables  with  distribution  function  P(x),  then  an  empirical 
characteristic  function  $n(u)  may  be  defined  as 


in(u)  =  [  e1  ux  dPn(x)  «  ±  E  « 
J  -®  n  J  *x 


(1.2) 


and  <t>n(u)  completely  specifies  the  sample  Xt,  X2 , . . . ,  xn  up  to  permutations.  In 
addition  it  follows  almost  by  definition  that,  for  each  fixed  u,  E(e1uXJ)  =  $(u),  so 
that  <t>n(u)  is  unbiased  for  $(u).  Thus  by  the  strong  law  of  large  numbers  (Rao, 
1965,  p.  97)  in(u)  is  a  strongly  consistent  estimator  of  $(u)  as  well  as  being 
unbiased. 

The  potential  applicability  of  sample  characteristic  functions  in  the  inference 
setting  of  tests  of  goodness  of  fit  has  been  Investigated  by  Heathcote  (1972),  Paulson 
and  Thornton  (1975),  Feurverger  and  Murelka  (1977),  Koutrouvelis  (1980), 
Koutrouvells  and  Kellermeler  (1981),  Murota  and  Takeuchi  (1981),  Ceorgo  and 


Heathcote  (1982),  Bryant  and  Paulaon  (1982),  Ball  and  Welsh  (1983),  Heathcote 


(1982),  Epps  and  Pulley  (1983),  and  Csorgo  (1986).  Further  reference*  and  a 
review  are  given  by  csorgo  (1984). 

This  paper  make*  uae  of  the  goodness  of  fit  statistic 

nan  *  n  T  lin(u)  -  8o(u)l*  <Mu),  (1.3) 

j  -a 

where  w(u)  is  a  given  weighting  function.  This  staltstic  is  clearly  analogous  in  fora 
to  the  Cramer -von  Mines  statistic.  There  will  be  situations  in  which  it  is  more  natural 
to  use  (1.3)  than  the  more  standard  empirical  distribution  function  (EDP)  statistics. 
For  example,  the  existence  of  a  viable  characteristic  function -oriented  procedure 
would  be  of  both  theorectical  and  computational  value  in  tests  of  hypotheses  concerning 
the  stable  distributions,  some  compound  distributions,  or  in  cases  where  under  the  null 
hypothesis  the  population  distribution  is  neither  purely  descrete  nor  absolutely 
continuous .  Furthermore ,  most  of  our  results  are  essentially  Independent  of  the 
dimension  of  the  population  under  investigation  and  will  be  directly  applicable  to 
problems  of  multivariate  goodness  of  fit.  The  multivariate  case  could  be  far  more 
Important  than  the  univariate  case  because  the  uniform  continuity  of  «o(u)  and  8n(u) 
permits  simplification  in  analysis  of  the  test  statistics  which  will  not  be  available  to 
multivariate  EDP  statistics. 


2.  A  GOODNESS  -OP-FIT  TEST 

The  problem  to  be  considered  here  is  the  construction  of  a  procedure,  based  on 
the  empirical  characteristic  function,  whereby  the  goodness -of -fit  hypothesis 

Bq>  F  ■  Fo  or  Hqj  d(u)  m  $o( u )  (2.1) 

may  be  tested  against  the  general  alternative  hypothesis 
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■AwW.'/v-,'.'* 


P  p  Pq  or  “i*  $(u)  p  $q(u>‘ 


(2.2) 


Here  P  1 a  the  distribution  function  of  the  population  in  question  and  Pq  is  completely 
specified}  the  corresponding  characteristic  functions  are  <t>  and  <fc0  respectively.  The 
formulation  of  hypotheses  in  terms  of  <p  and  will  be  more  natural  for  our  purposes 
for  we  will  be  working  for  the  most  part  in  the  transform  space.  The  test  procedure 
is  based  on  the  statistic  (1.3)  where  $n(u)  is  the  empirical  characteristic  function 
computed  from  a  random  sample  of  sire  n,  Xlt  Xz, ,  xn,  drawn  from  the 
population.  The  weighting  function  w(u)  in  (1.3)  is  a  given  distribution  function, 
nondecreasing,  continuous  from  the  right  and  bounded,  with  w(  -®)  =  o,  w(»)  =  0  >  0. 
The  value  of  the  constant  0  is  immaterial  and  will  generally  be  taken  to  be  unity. 
Since  |$n(u)|  and  |$0(u)|  are  bounded  above  by  l,  the  integral  in  equation  (1.3) 
must  converge.  We  will  usually  refer  to  w  as  a  "weighting  function"  to  avoid  confusion 
with  the  distribution  functions  P  and  Pq  . 

The  consistency  of  the  test  of  null  hypothesis  based  on  n*n  is  baaed  on  two 
results  of  Bryant  and  Paulson  (1979). 

Lemma  2.1 

The  quantity  l$n(u)  *  $(u)|*  dw(u),  where  in(u)  Is  the  empirical 

J  -op 

characteristic  function  based  on  a  random  sample  of  size  n  drawn  from  the  distribution 
whose  characteristic  function  is  $(u) ,  has  mean  and  variance  given  by 

E  j*  lin(u)  *  *u)|*  dw(  u )  =  i  J*  (1  -  IdKu )  I  * )  dw(u)  (2.4) 


var  [*  l$n(u)  ‘  *(u)l*  dw(u)  * 

J  -« 

2-%j3)  tj"j*<u>l2  <*w(u)}4  +  ^  J*®  f^tWu+t)!*  +  l*u-t)|*}  dv(u)dv(t) 
— ■  |*  |*  Re{<Xu+t)<D*(u)4>«<t)  +  $(  u-t  )$*(  u  )4K  t ) )  dw(u)dw(t).  (2.5) 


Let  nan  be  denned  as  in  equation  (1.3)  where  $n(u)  is  based  on  a  random 
sample  of  else  n  from  the  population  whose  characteristic  function  is  <$(u) .  Then  an 

converges  to  the  quantity  [  !$(u)  -  $0(u)|a  dw(u)  with  probability  one. 

Since  w(u)  is  strictly  increasing  it  places  positive  mass  on  every  nondegenerate 

u-lnterval  and  this  implies  that  the  ]  l<Ku)  -  $0(u)|z  dw(u)  is  zero  if,  and 

only  if,  F(x)  =  P0(z).  By  Theorem  4.1  nan  converges  in  law  to  a  distribution 
function  G .  Thus  the  goodness -of -fit  test  based  on  nan  will  have  a  rejection  rule  of 
the  form 

Reject  H0  if  and  only  if  nan  >  ct" 

where  ca  is  a  critical  value  chosen  so  that  asymptotically  the  size  of  the  teat  is  fixed 
at  a. 

If  r  /  P0 ,  some  x,  than  an  will  converge  strongly  to  a  constant  other  than  zero 
as  n  ♦  os  so  that  nan  will  almost  surely  diverge  up  to  as.  This  argument  is  summarized 
as 

Theorem  2.1  The  goodness -of -fit  test  of  the  hypothesis  Bo:  P  »  Fq  based  on  the 
statistic  an  of  (1.3),  where  the  weighting  function  w(u)  is  strictly  increasing,  is 
consistent;  that  is,  for  any  alternate  distribution  F  ft  Fq,  the  power  of  the  test 
approaches  one  as 
n  ■*  ®. 

Tests  of  fit  based  on  only  a  finite  number  of  u -values  cannot  be  consistent  since 
the  values  taken  on  by  a  characteristic  function  at  a  finite  number  of  u-  values  do  not 
characterize  a  distribution  function.  The  assumption  in  Theorem  2.1  that  w(u)  be 
strictly  increasing  may  be  relaxed  when  the  population  of  interest  poses  sees  an 


analytic  characteristic  function  for  thsn  a  characteristic  function ,  and  tones  its 
distribution  function  is  uniquely  determined  by  its  values  over  any  non -degenerate 


m 


u- interval  (Lukacs,  1970,  Chapter  7). 

The  statistic  an  is  similar  in  structure  and  actually  reduces  as  a  special  case  to 
the  Cramer -von  Mlses  statistic 


-  ["  (rn<*)  -  Po(*))*  dFo(*) 


(2.6) 


for  testing  Hqi  F  =  Fa ,  where  F0(x)  is  the  completely  specified  distribution  function 
and  F„(x)  is  the  sample  distribution  function  corresponding  to  the  random  sample  Xx, 
Xz , . . . ,  xn  putatively  drawn  from  Fq(x).  we  now  sketch  the  proof  of  this  assertion. 

on  the  Hilbert  space  L*C0,ll  of  all  square -lntegrable  complex  valued  functions 
on  to,l)  the  Inner  product  of  tiro  functions  h  and  g  is  given  by  (f,  g )  = 

f  f(t)g*(t)dt.  The  set  of  functions  (el2,rkt,  k  =  o,l,...)  is  a  complete 
JO 

orthonormal  system  on  l2C0,1J. 

By  means  of  the  probability  integral  transformation,  (2.6)  is  reduced  to  the  form 


=  \X  *S(t)  dt  =  I  Ig„n* 


(2.7) 


where  gn(t)  =  Gn(t)  -  t  and  Gn(t)  is  the  sample  distribution  function  corresponding 


to  the  transformed  sample  Tx,  Y2, . . . ,  Yn .  By  determining  the  Fourier  coefficient  of 
gn(t)  with  respect  to  el2,rkt  and  utilizing  Parseval's  equality  in  (2.7)  it  may  be 
shown  that 


11m  ,cn(u>  ■  c0(u)|«  lcn(  -2rrk)  -  c0(-2irk)l* 

u*0  u  k=  -®  4rr*X* 

MO 


f*  lcn(u)  -  c0(u)|*  dw(u), 
J  -00 


(2.8) 


(2.9) 


where  Cg( u )  la  the  charactarlatlc  function  of  a  random  variable  on  CO.ll  and 
cn(u)  la  tha  empirical  charactarlatlc  function  aaaociatad  with  the  Yj  and  w(u) 
la  a  atap  function  whose  definition  la  clear  from  aquation  (2.8).  Tha  Cramer -von 
Klees  statistic  la  thus  a  special  case  of  na* . 

3.  ASYMPTOTIC  DISTRIBUTION  OP  THE  STATISTIC  na^P> 

Me  first  consider  tha  distribution  of 

P  m 

naf,P>  =  n  E  l$n<Uk)  *  <«Uk>l*  *(Uk>.  (3.1) 

k=i 

a  discrete  version  of  the  nan  of  equation  (1.3).  The  uk  in  (3.1)  aura  pre-selected 
abscissae  and  the  wk  =  w(uk )  are  given  non-negative  weights.  There  are  at  least  two 
good  reasons  for  considering  .  First,  it  will  turn  out  that  the  asymptotic 

distribution  of  nr.n  may  be  obtained  in  essentially  the  same  manner  as  that  of  na^P)  of 

(3.1) .  Thus  the  simplified  problem  will  serve  to  motivate  later  considerations. 
Secondly,  there  are  situations  of  practical  Interest  where  <t>(u)  and  w(u)  are  such  that 
the  integral  in  equation  (1.3)  cannot  be  explicitly  evaluated .  if,  for  example,  the 
desired  weighting  function  were  dw(u)  =  exp(-u*)du,  then  in  equation  (3.1)  we  might 
choose  the  uk  and  wk  to  be  the  abscissae  and  weights  associated  with  the  Heraitian 
quadrature  of  order  p  (Stroud  and  Secrlst,  1966,  p.  217). 

we  shall  always  take  p  to  be  even  and  uk  =  -up.k+1,  wk  «  wp  _k+1 ,  k=i,2, . . .  ,p. 
Since  wk  =  wp.k+x  and  (Re  s  -  Re  S)(lm  i  -  Im  *)  1s  an  odd  function  we  may  write 

(3.1)  as 


na£P>  =  n  E  (y„( u) 

k=i 


where  y(u)  is  the  transform,  y(u) 


..  .  -r. -f.  ' .  •  .  •  .  \  ...  .  .  -r  . J  ...  ,-,v 


Y(  u))1  wk.  (3.2) 

Re  S(u)  f  ime(u),  and  yn(u)  is  its  sample 
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*  _ »A.  —A  -A 


■-.v\ 


counterpart,  yn(u)  *  R*  $n(u)  +  ™  $n(u)* 


So«e  rearrangement  gives 


naf,p> 


0.3) 


where 


tn  -  n'^E^  gj .  £j  =  (*!.)•  ezjf‘‘»  ®pj)T* 


and 


ekj  =  (cos  ukXj  +  sin  ukXj  ■  Re  «uk)  •  !•  <Kuk>)  wjjf. 


The  pxl  vectors  Sj  are  independently  and  identically  distributed  with  mean 
vector  o  and  pxp  covariance  matrix  M  =  (m3k  )  given  by 

mJk  =  X(uj,uk)(wJwk)%  0.4] 

where,  as  is  easily  shown, 

K(u,v)  =  Re  <Xu-v)  +  im  4*u+v)  -  CRe  4Ku)  +  Im  <Xu)lCRe  ^v)  +  xb  V)1 
=  n  cov  (yn(u),  yn(v)>. 

Since  M  is  a  covariance  matrix  it  is  symmetric  and  positive  semi -definite  of 
rank  r  a  p.  We  may  therefore  extract  p  real  orthonormal  eigenvectors  cq< 

*1  =  1,2,...,  p,  which  span  p -dimensional  Euclidean  space  and  satisfy 

c{  M  £k  =  *q«qk,  i  a  q,  k  a  p  (3.5 


ldtere  8qk  is  Kronecker's  delta  and  Ax,  Az,...,  Ap  are  the  non-negative 
eigenvalues  of  M  which  are  arranged  so  that  Az,  Az,...,  Ar  are  positive  While 
Ap+1  *  ...  =  Ap  =o.  Referring  the  inner  product  of  equation  (3.3)  to  the 
orthonormal  basis  (cl ,  cz,...,  cp)  gives 


(3.6) 


"4P)  %§*<£« 


since  the  random  variables  c ^  tn  are  degenerate  at  the  origin  with  prob¬ 
ability  one  for  q  >  r.  Proa  (3.6)  and  the  definition  of  t„. 


r  .  n 

"4P)  =  E  A  J  -  E  *qj 

q= i  i  nil  3=1 


(3.7) 


where 


£q  S3 


#  =  1»2 j  #  #  •  f  rj  J  =  1  y 2 1 « <  •  f  n « 


The  rxl  vectors  zj  =  (*xJ ,  zzj , . . . ,  zri  )T,  J  =  1,2, ....  n,  are  Independent 

and  identically  distributed  with  mean  vector  o  and,  froa  (3.5),  covariance 

.  " 

matrix  I.  since  n'*  E  converges  in  law  to  Nr(o,  I)  (Rao,  1965,  p.  108) 
~  J=i 

we  find  that  naf,P)  is  asymptotically  distributed  as 


E  *q  Xq i 
q=i 


(3.8) 


where  Xq  are  Independently  and  identically  xz  distributed  on  one  degree  of  freedom. 
The  characteristic  function  of  na{,p)  is 


c(p)(u)  = 


exp(  \  E  tan'1  (2Aqu))  r 

- Efi -  ,  I  E  tan*1  2Xqu|  <  ~  , 

r  q=l  2 

n  (l+4Ai  U*)* 
q=i 


(3.9) 


=  {  n  ( l-2Aqiu  )}^ 
q=i 

=  II  -  2uiMI*'« 


(3.10) 


(3.11) 


The  proper  branch  of  the  square  root  in  (3.10)  is  specified  by  (3.9). 

_  p  . 

Equation  (3.11)  may  be  determined  directly  by  observing  that  n  E  (yn(u)  * 

3*i 
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y(u))2  la  distributed  asymptotically  as  a  sub  of  correlated  x2  variates 
(Lukacs  and  Laha,  1964,  p.  40-43).  m  have  thus  proven 


Theorem  3.1 

The  goodness -of -fit  statistic  na^P)  defined  by  (3.1)  has,  when  the  null 
hypothesis  H0  is  correctly  specified,  the  asymptotic  distribution  whose  characteristic 


function  is 


r 

C<P>(u)  =  {  ni(l-2AqiU))'%  =  II  -  2ulMI  "v 


(3.12) 


where  M  is  the  covariance  matrix  whose  elements  are  given  by  (3.4), 

«w» 

ax,a2,...«  Ar  are  the  positive  eigenvalues  of  N  and  r^p  is  the  rank  of  N. 

The  determinant  in  (3.12)  may  be  expanded  ( PogorzelsW. ,  1966,  pp.  31-32)  in 
terms  of  the  covariance  kernel  and  the  weights  Wj  as 

l£  '  A  MI-%  =  l-A  £  K(  Uj  ,u  j  )Wj  +  £  E  E  C(uJ,uk)wJwk 
3-x  21  J  k=x 

/  -A )P  P  P  P 

£  ...  £  C(u«  ,U«  . Uj  )  n  Vj  (3.13) 

pi  Jxal  j1=l  J 1  J 1  Jp  1=1  Ji 


where  we  have  set  A  =  2ul  and 


ug)  * 


K(ux,ux)  .  .  .  K(ulfug) 


^(Ug.Ui)  .  .  .  K(Ug,u, ) 


(3.14) 


Equations  (3.12)  and  (3.13)  are  suggestive  of  the  results  that  we  may  expect  when 
we  let  p  (as  well  as  r)  tend  to  infinity  in  such  a  way  that  (3.1)  becomes  the  integral 
expression  (1.3). 


mmm 


4. 


THE  ASYMPTOTIC  DISTRIBUTION  OF  nan 


This  Motion  contains  our  main  mult  concerning  the  limiting  distribution  of  the 
test  statistic.  some  preliminary  lemmas  are  stated  without  proofs.  The  main 
reference  for  this  section  (and  Section  6)  is  Dunford  and  Schwarts  (1958). 

Define  the  function 


Sj(u)  *  cos  uXj  +  sin  uXj  -  Re  $(u)  -  Im  u )  (4.1) 

for  J  =  1,2, ...  •  n.  These  processes  are  independently  and  identically  distributed 
with  mean  function  E(Sj(u))  =  0  and  continuous  covariance  kernel  K(u,v)  given  at 
(3.4).  Let  tn(u)  be  defined  by 


tn(u)  =  n*1*  ^«j(u)  =  n%  (yn(u)  -  y(u)) 


(4-. 2) 


so  that 


na- 


-  r. ts<u 


X*w(u), 


(4.3) 


the  squared  norm  of  tn(u)  in  L2(w),  the  Hilbert  space  of  functions  square  integrable 
with  respect  to  w(u).  We  shall  employ  Parseval’s  equality  to  expand  nan  of  (4.3)  in 
terms  of  a  complete  orthonormal  system  In  L*(w)  consisting  at  least  partially  of  the 
eigenfunctions  of  the  integral  operator 

K  y(u)  *  f*  K(u,v)y(u)dw(v).  (4.4) 


First  we  need  some  results  concerning  this  operator. 

Lemma  4.1 

There  exists  a  finite  or  countably  infinite  orthonormal  system  of  eigenfunctions 


fq(u) ,  q  =  i,2,...,  of  the  integral  operator  K  of  (4.4),  where  fq(u)  is  associated 
with  the  eigenvalue  ,  which  together  with  an  at  most  countable  set  of  functions  nq(  v) 


j®JC(u,v)  f)q( v )  dv(v)  =  0  a.e.  (w),  q  =  1,2,..., 


satisfying 


fora  a  complete  orthonormal  system  in  L2( w) . 

These  assertions  follow  from  the  theory  of  linear  operators  of  the  Hilbert -Schmidt 
type  in  Hilbert  space  (Dunford  and  Schwartz,  1958,  p.  1012,  and  Schmeidler,  1965, 
p.  57). 

Lemma  4.2 

The  real  and  positive  non -zero  eigenvalues  At,  x2,...,  of  the  self  ad  Joint 
integral  operator  K  of  (4.4),  arranged  in  decreasing  order  and  repeated  according  to 
their  multiplicity  satisfy 

“  xq  =  f®  K(u,u)dw(u)  =  P  (1  -  |<Xu)|*)dw(u)  <  ®. 
q=l  J  -a)  J  -« 


Lemma  4.3 

The  Fredholm  determinant,  defined  by  the  power  series 
®  /  -x  \J  r 

D(A)  =  l  +  C(ux,u2 . u j  )dw( u ) .  . .  dw(uj  ), 

where  C(uL,u2, . . .  ,uj )  is  defined  in  (3.14),  is  absolutely  convergent  for  all  complex 
A  and  may  be  expressed  as  the  infinite  product 

D( A )  =  n  ( 1  - AqA ) , 
q»l 


4  "  • 


V.V. 


<*.- 


the  latter  converging  absolutely  amd  uniformly  for  all  A  in  any  bounded  region  of  the 
complex  plane. 

Because  the  kernel  K(u,v)  is  real,  the  real  and  imaginary  parts  of  any 
eigenfunction  fq(u)  associated  with  the  eigenvalue  Aq  are  also  eigenfunctions 
associated  with  Aq.  The  complete  orthonormal  basis  (fq(u),  r)q(u),  q  =  1,2,...)  of 
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t—a  4.1  nay  therefore  be  aaauned  to  be  compossd  of  real  functions,  and  for 
simplicity  we  will  take  then  as  such. 

Our  primary  result  will  be  given;  In  the  proof  we  will  assume  that  the  number  of 
eigenvalues  xq  Is  infinite.  If  this  Is  not  the  case  (as  in  Theorem  3.1)  the  required 
alterations  will  be  obvious. 

Theorem  4.1 

The  goodness -of -fit  statistic  nan  of  equation  (1.3)  has,  when  the  null  hypothesis 
Bo  is  correctly  specified,  the  asymptotic  distribution  whose  characteristic  function 
c(u)  is 

OD 

c(u)  =  n  ( l-2Aqiu)"^  «  D‘*(2ui) 

<J=i 

where  Ax,a2,...,  are  the  positive  eigenvalues  of  the  integral  operator  given  by 
equation  (4.4)  with  kernel  K(u,v)  defined  at  (3.4).  0(A)  is  the  Fredholm 

determinant  associated  with  this  operator,  defined  In  Lemma  4.3. 

Proof 

Using  Parseval's  equality,  we  expand  the  squared  norm  In  equation  (4.3)  with 
respect  to  the  complete  orthonormal  system  (fq(u),  hq(u),  q  =  1,2,...}.  This  gives 

na„  =  E  (tn,  fq)*  +  £  (t„,  n„)*  •  (4.5) 

q=l  q=l 

Now  by  Lemma  3.1  we  have  E(tn(u))  =  0  and  cov(tn(u),  tn(v))  =  K(u,v). 
Therefore,  use  of  Fublni's  theorem  gives,  for  q  =  1,2,..., 

B(tn,  rjq )  =  f(tn»  fq  )  =  0* 

E«tn,  T)q)4}  =  0,  (4.6) 

E«tn,  fq)z)  *  Aq  r  f$(u)  *  Aq.  (4.7) 

}  -aj 
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PS: 


Pro®  equations  (4.6)  and  (4.7)  we  have 

“  E(tn,  fq  )*  +  £  E(tn,  rjq  )*  =  £  Aq 

<5=1  q=l  M  q=l 

which  we  have  shown  to  be  finite.  Accordingly  ( Rao ,  1965,  p.  91),  the  sum  in 

OD 

equation  (4.5)  is  convergent  with  probability  one  and  E(nan)  =  E  A0. 

q=l 

Each  of  the  terms  (tn,  T)q)z  in  equation  (4.5)  is,  with  probability  one,  degenerate  at 
the  origin.  Since  there  are  at  aoet  a  countable  number  of  such  terms,  we  have  that 
with  probability  one  nan  may  be  expressed  as 


nan  =  E  ( tft,  fq)*. 
q=l 


(4.8) 


As  in  the  proof  of  Theorem  3.1,  let 


*«,)  •  <*)‘  f,)  •  1  *  3  *  *•* .  ». 


so  that 


nAn  =  q=i  y  ]£i  *«•'] 


(4.9) 


Ihus,  E( Zq j )  =  0  and 

Cov( Zq j ,  *q •  j  )  =  fq(u)  [  J*  K(u,v)fq .(  v)dw(  v)]  dw(u) 


=  — -  [*  fq(u)fq.(u)dw(u)  =  sqq.  . 

<AqAq')%  * 


If  we  now  define  the  truncated  sums 


n4r>  =  E  (tn,  fq)*  =  E  Aq(n'%  E  *qJ  )* 

q=l  J=1 


9*1 


(4.10) 


r  =  1,2,...,  it  follows  by  Theorem  3.1  that  naf,r )  is  asymptotically 
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r 

distributed  as  q( r )  =  E  Aa  Xq •  a  weighted  sub  of  Independent  x*  variates 

q=l 

one  degree  of  freedom,  with  characteristic  function  c<r>(u)  given  by  (3.10)  with  p=r . 

Let  Pn  be  a  sequence  of  distribution  function  converging  to  a  distribution  function 
P  on  the  continuity  set  of  F .  This  mode  of  convergence  is  denoted  by  Pn  =>  p.  Let 
Gn  be  the  distribution  function  of  nan,  G^r)  the  distribution  function  of  na^r)  and 
G<r)  the  distribution  function  of  Q(r)  whose  characteristic  function  is  c<r)(u).  We 
have  shown  that  G^r)  =>  G(r>  for  fixed  r.  An  application  of  the  continuity  theorem 
( Lukacs ,  1970,  pp.  48-50)  and  Lemma  4.3  implies  that  G^r)  =>  G . 

The  proof  Is  completed  by  constructing  a  sequence  (rn,  n  =  1,2,...}  which 

diverges  monotonies lly  up  to  +a>  and  showing  that  Gnn  =>  g  Which  will  imply 
that  Gn  =>  G  (Rao,  1965,  pp.  100-102).  Since  the  distribution  functions  G<r)(x) 
are  continuous,  we  have  by  Polya's  theorem  (Rao,  1965,  p.  100) 

llm  sup  |G^r*(x)  -  cKr>(x)|  =  o. 

rvxx>  x 

Bence  we  can  define  integers  qp  by  qp  =  least  Integer  >  qp_i  such  that 

|Gn(x)<p)  G<p)(x)l  <  2~p  for  all  n  a  qp,  with  q  =  1,2,...,  qj  *  0,  Me  than 
take 

rn  *  p  for  n  «  qp  +  1,  qp  +  2, . . . ,  qp+1, 
n  —  1 , 2 , . .  • ,  p  —  1,2,... 

Then  for  any  x  and  p 

ia{,r°  }(x)  -  G<P)(x)|  <  ,  n  =  qp+l,  qp+2 . q(p+i>  (4.11) 

Let  «>0  be  given,  and  let  x  be  any  point  of  G.  Since  G<r)  =>  G,  there 
exists  an  r*  such  that 
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1  d 

without  loss  of  generality,  take  r*  large  enough  so  that  ——  <  -  . 

2 

Let  n*  =  qr r+l.  Then  for  any  nan*,  there  exits  an  t  a  r*  such  that 
q?  +  lanaq^  r+i ) •  For  such  n  we  have,  by  equation  (  ) 

|G},rn)(x)-G<?)(x)|  <  4-  *  -75  <  - 

2 r  2r* 

while  from  equation  (4.12), 

IG<?>(x)  -  G(x)|  <  |  . 

Thus 

|Gi,rn)(X)  -  G(  X)  I  *  |G{,rn)(X)  -  G<*>(X)|  +  |G<*>(X)  -  G(X)|  <  e 

for  all  n  a  n* .  It  follows  that  G„rn^  =>  G,  which  in  turn  Implies  that 
Gn  =>  G,  and  so  the  theorem  is  proven. 

The  correspondence  between  Theorems  3.1  and  4.1  is  clear.  Although  it  was 
assumed  in  Theorems  3.1  and  4.1  that  the  underlying  population  was  univariate,  the 
proof  would  still  go  through  with  only  notatlonal  changes  if  the  populations  were 
multivariate.  Note  that  the  function  w(u)  need  not  be  absolutely  continuous  and  the 
integral  in  the  definition  of  the  nan  need  not  be  restricted  to  a  finite  support.  For 
example,  the  symmetry  statistic  nTn  of  Peuerverger  and  Mureika  (1977)  is  of  the  same 
general  form  as  nan  and  Theorem  3 .  l  may  be  used  to  derive  the  asymptotic 
distribution  of  this  statistic  under  the  null  hypothesis  of  population  symmetry. 

5.  SPECIAL  CASES  AND  EXAMPLES 

The  results  of  Theorem  4.1  will  be  illustrated  in  this  section  by  applying  them  in 


several  specific  examples. 

(a)  suppose  the  hypothesised  distribution  Is  discrete  and  that  under  Hg  the 
population  random  variable  may  take  on  the  values  xx,  x2 . xa  with  positive 


probabilities  plt  p2 


Ps. 


E  Pq 
q*i 


1 .  Then  Re  ♦<  u )  =  £  Pa  cos  ( ux. )  and 

q»l  M 


Im*(u) 


s 

*^E^Pq  sin(uxq). 


A  little  algebra  will  reduce  the  kernel  K( u , v )  at 


equation  (3.4)  to 


K(u,v) 


■ 

E^tq(u)fq(v)Pq  , 


Where 


tq(u)  *  cos( uxq )  +  ein(uxq)  -  Re$<u)  -  lm*(u). 

Thus  the  kernel  Is,  In  this  case,  degenerate  and  it  may  be  shown  by  an  argument 
to  that  given  by  Smithies  (1950,  pp.  36-30)  that  the  nonzero  eigenvalues  of 
K(u,v)  are  precisely  those  of  the  matrix  product  AP,  where  the  (J,k)th  element 

of  A  is 

MS 

*jk  =  r  gj(u)g*<u)dw(u)  l  *  J,  k  *  a, 

J  -flo 

and  P  s  dlag  (Px,  P*,...,  P«}. 


It  Is  seen  that  the  situation  here  is  similar  to  the  one  considered  In  Theorem 

3.1.  The  asymptotic  distribution  of  nAn  is  that  of  a  finite  sum  of  Independent 

weighted  chi-squared  variates,  the  weights  being  the  nonzero  eigenvalues  of 
AP. 

The  case  described  above  applies  whenever  data  have  been  grouped}  It  also 

indicates  a  possible  approximation  method  to  be  used  when  the  eigenvalues  of  the 


kernel  K(u,v)  are  not  easily  obtained. 


(b)  Support  that  under  the  null  hypotheele  the  underlying  population  has  a 


Cauchy  distribution  with  location  and  scale  parameters  8  and  a  respectively,  so 
the  population  characteristic  function  is  *(u)  =  exp(l8u  -  c|u|). 

Suppose  the  weight  function 


w(u)  =  a  [  e'a,tldt 

J  -CD 


is  chosen,  where  a  is  a  given  positive  constant.  Then  na^  may  be  explicitly 
Integrated  and  Is  given  by 


pa  -  i  ?  2  _  I  tn+r/a)  .  _2n_ 

"  »  J,W  A  ( l+cr/a )*  +  1  +  J 


(5.1) 


tSiere  XlfX2,...,  x„  is  the  random  sample  drawn  from  the  population.  The 
Xi  -B 

variates  _  have,  if  the  null  hypothesis  Is  true,  the  Cauchy  distribution 

a 

with  location  parameter  zero  and  scale  parameter  y  =  -  .  Therefore,  the 

a 

null  distribution  of  (5.1)  depends  only  on  y,  which  we  will  here  take  to  be 
unity  for  simplicity.  We  may  thus  assume  that  $(u)  =  e-,u|  and  w(u)  = 

fU 

j  e'^ldt.  Then  the  covariance  kernel  K(u,v)  (see  3.4)  is  equal  to 


X(u,v)  =  e-lu-vl  .  e-lul-lvl  ' 


(5.2) 


It  may  be  shown  that  the  asymptotic  characteristic  function  of  nan  is  given  by 


c(u)  =  oHi(2ui)  =  n  fi  -  iijH] 
s=l L  Jia J 


(5.3) 


"here  jx9f  s  *  1,2,...,  are  the  positive  zeros  of  J,(u) ,  the  Bessel  function  of  the 
first  kind  and  order  2.  Equation  (5.3)  characterizes  the  asymptotic  distribution  of 
nan  as  that  of  an  infinite  sum  of  exponential  deviates  having  means  16/jf  9 .  The 
zeros  Jzt  have  been  tabulated  for  l  <  i  <  zo  in  Abramowltz  and  stegun  (1970, 


(c)  The  ability  to  analytically  obtain  the  eigenvalues  associated  with  the 

asymptotic  distribution  of  nay,,  as  In  the  previous  example,  is  unfortunately  rather 
unusual.  A  more  typical  sltautlon  is  illustrated  by  assuming  that  the  hypothesised 
distribution  Is  normal  with  mean  s  and  variance  a2 .  The  postulated  population 
characteristic  function  is  then  #(u)  =  exp( iSu  -  Vr*u2).  If  dw(u)  = 
a  exp(-a*u2)  du,  a  >  o.  Is  chosen,  then  equation  (1.3)  may  be  Integrated  to  yield 

na„  =  ^  i  J=i  exp^Xj-Xk)2/^*)  -  -(Xj  -8  )*/(  2o2+4a2  ) ) 


nj/jror 
(or*  +  a*)\ 


(5.4) 


It  Is  possible  to  obtain  the  asymptotic  characteristic  function  of  the  statistic  nan 
of  equation  (5.4)  under  a  correctly  specified  null  hypothesis.  As  an  example,  when 
a  is  chosen  equal  to  a  the  asymptotic  characteristic  function  c(u)  of  nay,  may  be 
shown  to  be  equal  to 


*  (l-2bg0iu}Hl  (l+2diu  *  [”*]  (-f )B/(l-2bgz*iu))~^  (5.5) 

q=0  saQl 


where  b,  d,  f  and  g  are  certain  constants  whose  values  are  not  important  to  the 
discussion.  Unfortunately  the  characteristic  function  c(u)  is  not  readily  inverted 
numerically,  and  so  this  result  seems  to  be  of  little  practical  value.  What  are  really 
needed  are  the  values  of  the  weights  associated  with  the  representation  of  the 
asymptotic  distribution  of  nan  as  a  weighted  sum  of  Independent  chi -squared  random 
variables,  and  these  are  not  made  evident  by  formula  5.5. 

Fortunately,  it  turns  out  that  the  asymptotic  distribution  of  nan  In  the  case  where 
the  underlying  population  is  normal  may  be  accurately  approximated  by  the  asymptotic 
distribution  of  a  quadrature -type  sum  of  the  form  considered  in  Theorem  3.x.  Such 


an  approximate  procedure  is  easier  and  more  accurate  than  the  numerical  inversion  of 
the  expression  in  equation  (5.5).  As  in  example  (b)  the  null  distribution  of  (5.4) 
depends  only  on  the  ratio  y  =  a/a,  so  we  may  take  without  loss  of  generality  *(u)  * 

exp( uz )  and  dw( u )  =  exp( -u*  )du  so  that 

nan  =  n  |  lin(u)  *  $(u)|2  e-^2  du.  (5.6) 

The  form  of  equation  (5.6)  suggests  that  the  asymptotic  distribution  of  nan  be 
approximated  by  that  of  (3.1)  where  uk  and  wk,  k  =  1,2,...,  p,  are  respectively 
the  zeros  of  the  Hermlte  polynomial  of  (even)  order  p  and  the  associated  Heraitian 
quadrature  weights.  These  have  been  extensively  tabulated  by  Stroud  and  Secrlst 
(1966,  p.  217).  The  asymptotic  characteristic  function  of  na^p)  Is  given  In  Theorem 
3.1,  where  x£p),  q  =  1,2,...,  p,  are  the  eigenvalues  of  the  pxp  matrix  M<p)  whose 
(q,  q')th  element  Is,  from  (3.4) 

•<;> .  i'>v* .  .->•** (5.7> 

To  aid  In  the  comparison  of  the  distribution  of  nan  and  ,  we  tabulated  the 

values  of  the  first  four  cumulants  of  the  asymptotic  distributions  of  na£p)  and  of  nan 
for  various  values  of  p  and  for  y  ranging  from  to  3.  The  jth  cumulant  of  naj^P)  is 
given  by 

*$p)  *  (J-DI2J-1  C  (A^P))i,  J  =  1,2 .  (5.8) 

q-1 

which  follows  from  the  characterization  of  na^p)  as  a  weighted  sum  of  Independent 
chi  -squared  random  variables  each  with  one  degree  of  freedom,  along  with  the 


additivity  of  cumulants.  The  cumulants  k  j,  j=l ,  2 , . . . ,  of  nan  are  calculated  by 
formulas  which  will  be  provided  in  Section  7.  In  particular  kl  and  kz  can  be 
obtained  by  the  results  of  Lemma  2.1. 
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Our  computations  Indicate  that  the  first  four  cumulante  of  na£p>  reach  their 

asymptotic  values  for  y  =  2  =  %  when  p=12,  for  y*l  when  p=20,  for  y=3/2  when 
Pb36,  and  for  y=3  when  p=64 .  The  cumulante  of  na£p)  are  rapidly  convergent  to 
those  of  nan  if  the  ratio  y  =  a/a  is  not  large.  When  y  is  large  the  kernel  JC(u , v) 

become 8  poorly  behaved  and  is  therefore  difficult  to  approximate  by  M.  For  y 


Kv- 


relatively  small  and  p  sufficiently  large  the  asymptotic  distribution  of  naf,p ) 
provides  a  good  approximation  to  the  distribution  of  nan. 

The  use  of  approximating  sums  of  the  form  given  in  equation  (3.1)  to  determine 
the  asymptotic  distribution  of  nan  is  not  always  as  effective  as  the  preceding  results 
might  lead  one  to  suspect.  Referring  back  to  example  (b)  where  the  hypothesized 
population  was  Cauchy  with  scale  parameter  one  and  the  weighting  function  dw(u)  - 
exp(  - 1  u  |  )du  was  employed  in  the  definition  of  nan,  it  is  reasonable  to  approximate 
nan  by  use  of  a  quadrature  sum  of  the  La  guerre  type  (Stroud  and  Secrlst,  1966,  p. 
2S3 ) .  The  asymptotic  distributions  of  such  sums,  however,  converge  to  that  of  nan 
rather  more  slowly  than  mi  seen  in  the  case  where  the  underlying  population  was 
normal.  This  is  to  be  expected,  since  the  covariance  kernel  K(u,v)  corresponding  to 
the  Cauchy  distribution ,  given  in  equation  (5.2),  is  not  everywhere  differentiable . 
Thus  by  approximating  the  distribution  of  nan  by  that  of  a  statistic  of  the  nature  of 
na^P)  of  equation  (3.1),  we  are  essentially  replacing  the  problem  of  obtaining  the 
positive  eigenvalues  associated  with  a  homogeneous  integral  equation  of  the  form 


Ay(u) 


=  r  k(u, 


v  )y(  v  )dw(  v ) 


(5.9) 


Cv:, 

rv  v 

r-‘/v 


with  the  problem  of  finding  those  corresponding  to  the  linear  system 


*yq  s  K(uq,uq» )w„.yq.  q  *  1,2, 


(5.10) 


-A  ,’ju  aA  -A, .  >A 


I  f  .  ■/'.  w*,.**",  .  w"*’**”h  j" 


.  m ..  ~  t*—  a  a'__  a 
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it  cannot  be  expected  that  the  linear  system  accurately  approximates  the  integral 


equation  unless  the  kernel  K(u,v)  is  smooth. 


6.  INVERSION  OF  THE  CHARACTERISTIC  FUNCTION 


The  distribution  of  the  goodness -of -fit  statistic  nan  Is  given  In  terms  of  Its 
characteristic  function  c(u)  by  Theorem  4.1,  but  in  order  to  apply  this  statistic  to  a 
given  testing  situation,  2m  inversion  of  the  characteristic  function  Is  required  to 
produce  the  necessary  critical  values.  The  problem  of  Inverting  c(u)  is  similar  to 
that  considered  by  Durbin  and  Knott  (1972)  in  their  analysis  of  the  residual  sums  Bp 
of  the  Cremer-von  Mlses  statistic  and  the  method  of  solution  outlined  here  Is 
essentially  Identical  to  their  procedure,  which  is  based  on  the  work  of  Gll-Pelaez 
(1951)  and  Imhoff  (1961). 

The  asymptotic  characteristic  function  of  nan  Is,  from  Theorem  4.1 


os  exp[-  £  tan'1  (2Aqu)| 

c(u)  =  n  (l-2Aalu)-*  =  1  q=1 _ _  .  (6.1) 

q=l  M  cd  " 

n,(l+4Aiu*  )< 
q-l  ^ 


Denoting  by  C(x)  the  asymptotic  distribution  function  of  nan  and  making  the 
substitution  2t  =  u  in  the  Inversion  formula  of  Gll-Pelaez  (1951),  leads  to  the 
approximation  (Imhoff,  1961) 


G(  x ) 


p 

u  n  (l  +  aJ  u*)* 
q=l  M 


du 


(6.2) 


P-1 

trt>ere  Ap  is  adjusted  by  setting  it  equal  to  E( nan  )  -  £  Aq  so  that  a  match 


of  the  expectations  of  the  two  suns  is  effected.  The  integral  in  (6.2)  is  evaluated 
by  Gaussian  quadrature  with  repeated  interval  halving  until  the  error,  based  on 
successive  iterations,  is  acceptably  snail.  Expression  (6.2)  can  be  used  to  compute 
the  exact  values  of  G(x)  »  Prlna^x) . 

7.  APPROXIMATIONS  TO  THE  DISTRIBUTION  OP  nA„ 

The  results  of  Sections  2,  3,  4,  and  5  make  possible  the  application  of  the 
statistic  nan  to  tests  of  goodness  -of  -fit  ;  its  distribution  under  a  correctly  specified 
null  hypothesis  has  been  ascertained  in  terns  of  its  characteristic  function,  and  a 
procedure  by  which  this  characteristic  function  may  be  inverted  has  been  presented, 
so  that  the  critical  values  on  which  the  hypothesis  test  is  based  nay  be  calculated. 
Even  so,  from  a  practical  point  of  view  these  results  are  unsatisfactory)  it  has  been 
seen  that,  in  order  to  obtain  the  characteristic  function  of  nan,  the  eigenvalues  of  a 
certain  integral  operator  must  be  determined,  and  once  this  has  been  done  a  numerical 
integration  is  required  to  perform  the  necessary  inversion.  (This  is  not  the  case  for 
Theorem  3.1.)  Of  course ,  if  the  proposed  testing  procedure  were  to  be  used  only  in 
a  United  number  of  more  or  less  typical  situations  this  would  cause  no  difficulty,  for 
then  tables  of  the  critical  values  of  nan  relevant  to  those  situations  could  be 
prepared.  A  goodness -of -fit  test  based  on  the  empirical  characteristic  function 
should,  however ,  be  applicable  to  a  wide  range  of  problems  precisely  because  it  is  in 
unusual  circumstances ,  where  the  standard  tests  are  not  appUcable  or  else  can  only  be 
employed  with  difficulty,  that  such  a  testing  procedure  would  be  of  greatest  value. 

It  is  clear  that  reasonably  simple  and  yet  accurate  approximations  to  the 
distribution  of  nan  are  requ.red.  It  will  now  be  shown  that  certain  approximations, 
formulated  in  terms  of  the  cumularrts  of  na^,  meet  these  criteria.  Simulation  results 
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presented  In  this  section  indicate  that  the  convergence  of  the  distribution  of  nan  is 
quite  rapid,  implying  that  our  results  are  of  practical  use  for  moderate  sample  sixes. 

The  iterated  kernels  Kj(u,v)  associated  with  the  kernel  K(u,v)  are  recursively 
defined  by  K1(u,v)  =  K(u,v)  and 

Kj(u,v)  =  |  Kj.1(u,t)K(t,v>dw(t),  J  a  2. 

J  -a> 

It  is  shown  in  Ounford  and  Schwartz  (1963,  pp.  1085-7)  that 

0)  =  f"  Kj(u,u)dw(u)  =  E  A$,  for  J  a  1, 
i  -an  q=l 

and 

D(A)  =  exp  J-  OjXl/j} 

for  all  l A |  sufficiently  small.  Since  from  Theorem  4. l  the  asymptotic  characteristic 
function  c(u)  of  nan  is  given  by  D ' ** ( 2ui ) , 

c(u)  =  exp^E^  0j(2ui)i/Jj? . 

Finally,  this  gives  the  cumulant  generating  function,  valid  for  all  I u  I  sufficiently  small 
(Cramer,  1946,  pp.  185-6), 

log  c(u)  =  %  £  *j( 2ui)J/J  =  "  kj(1u)1/J«. 

J=l  1=1  J 

Equating  coefficients  of  uJ  then  yields 

kj  -  (J-l)l  2J'1  Sj  .  (7.1) 

If  the  weighting  function  w(u)  in  equation  (1.3)  is  chosen  so  that  nan  can  be 
explicitly  integrated,  as  in  examples  (b)  and  (c)  of  Section  5,  then  the  evaluation  of 
the  first  several  integrals  in  may  often  be  performed  by  straightforward,  if 
somewhat  tedious,  calculations.  We  will  subsequently  need  expressions  for  the 


asymptotic  cuaulants  so  derived  for  (b)  and  (c)  at  Section  5.  We  rscord  tbs  results 


for  the  Gaussian  case  only. 

(a)  If  tbs  underlying  population  1s  normal  with  mean  8  and  variance  a*,  and 
dw(u)  *  a  exp(  -a*u*  )du ,  o  >  0,  then,  putting  y  *  ~  , 

{l-(y*+l)'%}  (7.2a) 

Kj  =  4fr((eya+4)-%  -  2(3y4+«ya+4)*%  +  2(y*+l  )** )  (7.2b) 

r,  *  16frs/*((9y*+i2y*+4r%  -  3(  2y*+10y*+12y*+4)-%  (7.2c) 

•f  3(  3y*+lly4+l2ya+4)*^  -  (  4y*4-12y4+4)‘%) 
k4  -  192ff*(t4(ya+l)(2ya+l)%]-1  -  4((  3y*+2  )*(  y*+2  )*  -  4y*(y*+i)*J-% 

♦  2C(y*4-l )(  4<y*+l  )*-2y4 )%  J  "1  +  2t(3y*+2)(y*+2))_1  (7. 2d) 

-  2C(y*+l)((3y*+2)(y*+2))%)*1  *  C2(y*+l)J**} 

It  seems  natural  to  approximate  the  null  distribution  of  with  that  of  a 

weighted  chi-squared  variate,  or  perhaps  the  sum  of  several  such  random  variables, 
the  weights  and  degrees  at  freedom  associated  with  the  approximating  variates  being 
chosen  in  such  a  way  that  the  first  several  cuaulants  of  nan  are  matched.  Three  such 
procedures  have  been  investigated! 

(a)  PatnalK's  (1949)  two-cusulant  y*  approximation 

The  null  distribution  at  nan  is  approximated  by  that  of  the  variate  Q  *  ax$»  a 
and  v  are  determined  by  matching  the  first  two  cuaulants  at  nan  with  those  of  Q. 
This  requires  a  *  kx/2k1,  v  =  2kJ/k4 .  Then,  given  any  x  >  0, 

P(na„  «  x)  Z  P( x*  * 


and  0|  determined  from  (7.1). 

In  every  situation  to  which  this  approximation  has  been  applied,  equations  (7.5)  and 
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(7.6)  have  yielded  real  and  positive  values  for  ax,  a2,  vx  and  v2a 

Direct  Inversion  of  the  characteristic  function  of  Z,  In  (7.3),  gives,  for  vx  + 
v2  >  2  (Gradshteyn  and  Ryzhlk,  1965,  p.  320,  3.394-7),  an  expression  for  the 
density  of  Z  which  may  be  Integrated  termwise  and  rearranged  to  yield  the  distribution 
function  Gz(x)  of  Z  as 


G7(„  =  <«./»a>*  2  5  r(%(v^M) 
71  ’  r(%v2)  kSo  it  * 


t1  •  n]"  (sr)  <7-7> 


where  Vv(  ■ )  is  the  distribution  function  of  a  chi  -squared  variate  with  v  degrees  of 
freedom  and  ax  Is  the  smaller  of  the  weights  ax  and  ax.  The  above  development 
allows  one  to  approximate  the  distribution  of  nan  by  P(nan  a  x)  3  gz(x) . 

If  the  underlying  population  is  specified  to  be  normal  with  mean  6  and  variance 
a*  under  the  null  hypothesis,  and  If  dw(u)  *  omxp(  -a*uz)du  is  chosen  in  the 
definition  of  nan,  then  equations  (7.2)  may  be  used  to  compute  cumulant  matching 
approximations  of  the  three  types  that  have  been  described  for  the  null  asymptotic 


distribution  of  the  test  statistic.  Results  are  presented  in  Table  l  for  the  case  where 

y  «  -  *  1.  Also  included  is  the  distribution  of  na^  calculated  by  means  of 
approximating  quadratures  as  In  example  (c)  of  section  5.  Quadratures  of  order  48 
and  64  yielded  Identical  results  for  all  entries  In  Table,  which  are  thought  to  give  the 
true  distribution  of  nan  to  within  +  10'*. 

It  can  be  seen  that  the  distribution  of  the  weighted  sum  of  two  chi-squared 


variates  provides  an  excellent  approximation  for  the  asymptotic  distribution  of  nan, 
particularly  for  large  x.  The  Pearson  approximation  gives  results  which  are  almost  as 
good  In  the  upper  tail,  although  as  must  be  expected  it  is  not  very  accurate  in  the 
vicinity  of  the  origin.  The  Pearson  and  sum  of  2x*’s  approximations  appear  to  be 
adequate  for  most  practical  purposes,  at  least  if  the  desired  size  of  the  test  is  not 
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TABLE  1 


Approximations  to  th#  Asymptotic  Mull 

"""W*  to  HoJTT"<r/t  .  , 


Patnaik 


Psarson 


0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

1.1 

1.2 

1.3 

1.4 

1.5 

1.6 

1.7 

1.8 
1.9 
2.0 
2.1 
2.2 

2.3 

2.4 

2.5 

2.6 

2.7 

2.8 
2.9 
3.0 


.1833 

.3277 

.4453 

.5417 

.6210 

.6865 

.7405 

.7851 

.8220 

.8526 

.8778 

.8988 

.9161 

.9304 

.9423 

.9522 

.9603 

.9671 

.9727 

.9774 

.9812 

.9844 

.9871 

.9893 

.9911 

.9926 

.9939 

.9949 

.9958 

.9965 


.1021 

.3239 

.4627 

.5651 

.6444 

.7072 

.7579 

.7990 

.8327 

.8604 

.8833 

.9023 

.9181 

.9313 

.9422 

.9514 

.9591 

.9655 

.9710 

.9755 

.9793 

.9825 

.9853 

.9875 

.9895 

.9911 

.9925 

.9936 

.9946 

.9954 


Sum  of  2  x*'i 


P(na„*x) 


.1431 

.3024 

.4376 

.5466 

.6332 

.7019 

.7564 

.7999 

.8350 

.8633 

.8864 

.9052 

.9207 

.9336 

.9442 

.9530 

.9604 

.9666 

.9718 

.9761 

.9798 

.9828 

.9854 

.9876 

.9895 

.9911 

.9924 

.9935 

.9945 

.9953 


.1396 

.3072 

.4426 

.5494 

.6341 

.7014 

.7554 

.7988 

.8339 

.8624 

.8857 

.9047 

.9204 

.9334 

.9441 

.9531 

.9605 

.9667 

.9719 

.9762 

.9799 

.9829 

.9855 

.9877 

.9896 

.9911 

.9925 

.9936 

.9945 

.9953 
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A  similar  comparison  at  approximations  results  w hen  the  underlying  population  is 
Cauchy  with  location  parameter  8  and  seal*  parameter  a  dw(u)  =  a  exp(  -alu  i  )du  and 
tha  ratio  y  *  <r/a  was  taken  ovar  a  range  of  values  including  unity.  The  rasults  ware 
tha  saaa  as  for  tha  normal  cars. 

Tha  cuaulant  matching  approximations  wars  also  investigated  whan  tha  distribution 
specified  under  tha  null  hypothesis  Is  tha  stable  law  at  index  %  whose  characteristic 
function  is 

♦o(u)  *  exp( - lul *  (1-1  sgn  u)). 

Tha  statistic  considered  was  tha  discrete  approximation  to 

nan  *  n  P  lin(u)  -  bo(u)|*  e‘|u|du 
J  -00 

given  by 

.  .  p  . 

na^P '  =  n  E  l<Muk>  *  *o<uk)l1  wk 
k*l 

where  uk  and  wk,  k  *  1,2, ,  60  are  tha  (two-sided)  Laguerre  integration  abscissas 
auid  weights.  The  natural  weight  function  is  exp(-lul^)  but  exp(-lul)  was  used  for 
computational  convenience .  The  cumulants  are  obtained  from  (5.8).  The  agreement 
between  the  asymptotic  null  distribution  given  by  Theorem  3.1  and  tha  approximation  is 
not  quite  as  good  as  in  tha  Gaussian  and  Cauchy  examples  but  still  tha  maxi  mi  im 
deviation  between  the  exact  and  approximate  distribution  determined  by  tha  weighted 
sum  of  two  independent  x*  variates  is  only  .008  and  in  the  upper  decile  is  at  mo* 
.003.  Pearson's  approximation  again  performs  quite  well  in  the  upper  tall  and  would 
probably  be  preferred  due  to  its  simplicity. 

Our  computations  indicate  that  cumulant- matching  approximations  provide  a  suitable 
means  for  the  evaluation  at  critical  values  required  for  use  in  tests  of 
goodness -of -fit.  Since  such  approximations  do  not  require  excessive  computation  or 


analyst! ,  t hay  make  the  uaa  o f  tha  atatiatic  faaaibla  froa  a  practical  point  at  view 
ovar  a  wide  range  at  situations . 

8.  RATE  OP  CONVERGENCE 

3  Inca  tha  derivation  at  tha  finite  sample  distribution  at  tha  variate  appears 
to  be  Intractable,  any  inferences  to  be  drawn  through  observation  of  this  atatiatic  will 
ordinarily  be  baaed  on  lta  asymptotic  properties.  Inferences  could  also  be  based  on 
tha  finite  sample  cumulants  (2.4)  and  (2.5),  obtained  in  special  cases  by  explicit 
integration  or  in  general  by  numerical  integration,  and  application  of  Patnalk's 
approximation.  This  would  not  be  difficult  in  principle  but  could  be  avoided  if 
convergence  of  the  finite  sample  distributions  is  sufficiently  rapid  so  that  the  statistic 
is  at  value  for  moderate  sasqple  sixes. 

TO  see  whether  the  null  distribution  of  nan  is  well  -approximated  by  its  asymptotic 
distribution  for  moderate  n,  nasties  from  each  of  the  normal  (y=l) ,  Cauchy  (y=l) , 
and  stable  populations  considered  in  the  previous  section  were  simulated.  Empirical 
distribution  functions  at  nan  (na^P)  for  the  stable  case),  based  on  either  2000  or 
12000  replications,  were  generated  for  sasiple  sixes  n  of  5,  15,  25,  and  50. 

Comparison  of  empirical  and  asymptotic  distribution  functions  was  made  by  the 
usual  chi -squared  statistic  applied  in  each  set  of  replications  to  teat  the  hypothesis 
that  it  was  drawn  from  the  associated  asymptotic  distribution.  The  probability  levels 
corresponding  to  these  tests  are  given  in  Table  2.  Based  on  this  empirical  evidence 
the  finite  sample  else  distribution  of  nan  do  seem  to  converge  rapidly,  and  use  of  the 
asymptotic  distribution  when  sample  else  is  greater  than  or  equal  to  5,  say,  would 
appear  to  be  justified. 


*  c  ‘  * 

'  :-f.' 
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